Semisupervised Clustering by Queries and Locally Encodable Source Coding

نویسندگان

چکیده

Source coding is the canonical problem of data compression in information theory. In a locally encodable source coding, each compressed bit depends on only few bits input. this paper, we show that recently popular model semisupervised clustering equivalent to locally encodable coding. model, task perform multiclass labeling unlabeled elements. At beginning, can ask parallel set simple queries an oracle who provides (possibly erroneous) binary answers queries. The cannot involve more than two (or fixed constant number of) Now all elements clustering) must be performed based noisy query answers. goal recover correct labelings while minimizing such equivalence codes leads us find lower bounds required variety scenarios. We provide querying schemes pairwise ‘same cluster’ - and AND queries, provable performance guarantees for schemes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semisupervised Clustering, AND-Queries and Locally Encodable Source Coding

Source coding is the canonical problem of data compression in information theory. In a locally encodable source coding, each compressed bit depends on only few bits of the input. In this paper, we show that a recently popular model of semisupervised clustering is equivalent to locally encodable source coding. In this model, the task is to perform multiclass labeling of unlabeled elements. At th...

متن کامل

Locally Decodable Source Coding

Source coding is accomplished via the mapping of consecutive source symbols (blocks) into code blocks of fixed or variable length. The fundamental limits in source coding introduces a tradeoff between the rate of compression and the fidelity of the recovery. However, in practical communication systems many issues such as computational complexity, memory capacity, and memory access requirements ...

متن کامل

Source-Optimized Clustering for Distributed Source Coding

Motivated by the design of low-complexity distributed quantizers and iterative decoding algorithms that leverage the correlation in the data picked up by a large-scale sensor network, we address the problem of finding correlation preserving clusters. To construct a factor graph describing the statistical dependencies between sensor measurements, we develop a hierarchical clustering algorithm th...

متن کامل

Feature Selection Algorithm for Supervised and Semisupervised Clustering

−In clustering process, semi-supervised learning is a tutorial of contrivance learning methods that make usage of both labeled and unlabeled data for training characteristically a trifling quantity of labeled data with a great quantity of unlabeled data. Semi-supervised learning cascades in the middle of unsupervised learning (without any labeled training data) and supervised learning (with com...

متن کامل

Supervised and Semisupervised Clustering Based on Feature Selection Algorithm Process

In clustering process, semi-supervised learning is a tutorial of contrivance learning methods that make usage of both labeled and unlabeled data for training characteristically a trifling quantity of labeled data with a great quantity of unlabeled data. Semi-supervised learning cascades in the middle of unsupervised learning (without any labeled training data) and supervised learning (with comp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Information Theory

سال: 2021

ISSN: ['0018-9448', '1557-9654']

DOI: https://doi.org/10.1109/tit.2020.3037533